Improving incremental signature-based Grobner 

basis algorithms 

Christian Eder 
c/o Department of Mathematics 

TU Kaiserslautern 
67653 Kaiserslautern, Germany 
eder c @mat hemat ik.uni-kl.de 

March 27, 2012 

Abstract 

In this paper we describe a combination of ideas to improve incre- 
mental signature-based Grobner basis algorithms having a big impact on 
their performance. Besides explaining how to combine already known op- 
timizations to achieve more efficient algorithms, we show how to improve 
them even more. Although our idea has a psotive affect on all kinds of 
incremental signature-based algorithms, the way this impact is achieved 
can be quite different. Based on the two best-known algorithms in this 
area, F5 and G2V, we explain our idea, both from a theoretical and a 
practical point of view. 

1 Introduction 

Computing Grobner bases is a fundamental tool in commutative and computer 
algebra. Buchberger introduced the first algorithm to compute such bases in 
1965, see [2J. In the meantime lots of additional and improved algorithms have 
been developed. 

In the last couple of years, so-called signature-based algorithms like F5, 
see [7], and G2V, see [5], have become more popular. Lots of optimizations 
for these algorithms have been published, for example, see [TIIBIIWTI]. Whereas 
the last two publications focus on the field of non-incremental signature-based 
algorithms, we focus our discussion in this paper to the initial presentation of 
this kind of algorithms, based on Faugere's initial presentation of F5 in [7]: 
Computing Grobner bases step by step iterating over the generators of the in- 
put system. The intermediate states of this incremental structure can be used 
to improve performance. 

The intention of this paper is not only to cover, to collect, and to compare 
the various optimizations found recently, but also to increase the algorithms' 
efficiency. As discussed in-depth in [B] , signature-based algorithms differ mainly 
on their implementation of two criteria used to detect useless critical pairs dur- 
ing the computations, the non-minimal signature criterion and the rewriting 
signature criterion; the optimizations presented in this publication have mostly 
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an impact on the first one criterion. We focus our discussion on the two best- 
known and most efficient incremental algorithms in this area, namely F5 and 
G2V. Due to their different, in some sense even opposed, usages of the above 
mentioned criteria, their behaviour under our optimizations gives a rather ac- 
curate picture of the general impact on the class of incremental signature-based 
algorithms. 

In Section [5] we introduce the basic notions of incremental signature-based 
algorithms. In [5] the idea of interreducing intermediate Grobncr bases between 
the iteration steps of F5 is illustrated: Speed-ups of up to 30% comparing F5C 
to the basic F5 can be achieved by minimizing the computational overhead 
generated due to the inner workings of signature-based algorithms. Section [3] 
shortly reviews this idea, from a more general point of view than it was done 
in its initial presentation back then, taking its effects on algorithms like G2V 
into account. G2V and the idea of using zero reductions actively in the current 
iteration step is content of Section^ The idea of using recent zero reductions in 
the algorithm goes back to Alberto Arri's preprint of [1] in 2009, where such a 
kind of optimization was mentioned for the first time. Combining these two, at a 
first look quite separated improvements in a clever way is the main contribution 
of this paper: We see that a quite easy idea can be used to get a faster detection 
of useless critical pairs; in the situation of G2V one even discards more elements, 
which leads to a huge improvement in the overall performance of the algorithm. 

2 Basic setting 

We start with some basic notations. Let i G N, AC a field, and TZ = K.[x\, . . . , x n ]. 
Let Fj = (/i, where each fj G TZ, and 7j = {Fi) C TZ is the ideal 

generated by the elements of Fj. Moreover, we fix a degree- compatible ordering 
< on the monoid M. of monomials of x±, . . . , x n . For a polynomial p G TZ, we 
denote p's leading monomial by lm (p), its leading coefficient by lc (p), and write 
It (p) = lc (p) lm (p) for its leading term. For any two polynomials p, q G TZ we 
use the shorthand notation 

r(p, q) = 1cm (lm(p), lm(g)) 

for the least common multiple of their leading monomials. 

Let ei, . . . , e m be the canonical generators of the free TvL-module lZ m . We 
extend the ordering < to a well-ordering -< on the set {tej \ t G M, 1 < j < to} 
in the following wajQ: tjej -< tue-u iff j < k, or j = k and tj < tk- We define 
maps 

7T : TZ 1 -> I t 

y.'j J'j'j ^ EJ=iPj/ji 

where pj is a polynomial in TZ for 1 < j < i. An element uj G TZ % with tv(oj) = is 
called a syzygy of /i, . . . , fi. The module of all such syzygies is denoted Syz(Fi). 
A syzygy of type fi-ej — fje-u is called a principal syzygy. The submodule of all 
principal syzygies of Fi is denoted PSyz(Fj) C Syz(F). Note that if a sequence 
Fi of polynomials is regular, then PSyz(F) = Syz(F). 

1 Note that this differs slightfy from the ordering given in [7], but our discussion is mainly 
based on [6]. Moreover, it simplifies notation. 
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Let /i+i G H\Ii. We describe algorithms that, given a Grobner basis Gi 
of Ii, computes a Grobner basis of Tj+i = (Fj+i), where -Fj+i = (fi, . . . , fi+i)- 
Thus we restrict ourselves to an incremental approach in this paper. 

In [5] the class of signature-based Grobner basis algorithms is introduced. 
Those give a new point of view on the computations taking so-called signatures 
into account. 

Definition 2.1. Let p G h+i, j 6 N with j < i + 1, and . . . ,hj G 72. such 
that /ij ^ and 

p = fti/i H fej/j. 

1. If t = lm (hj), we say that tej is a signature of p. Let 5 be the set of all 
signatures: 

S = {tej | t eM, j = + 

2. Using the well-ordering ^ on 5 we can identify for each p G 1Z a unique, 
minimal signature. 

3. For any monomial u G M. and any signature tej G 5 we define a multipli- 
cation u ■ tej :— (ut)ej. 

4. An element / = (tej,p) G S x 7j_|_i is called a labeled polynomial. For a la- 
beled polynomial / = (tej,p) we define the shorthand notations poly(/) = 
p, sig(/) = tej, and index(/) = j. Talking about the leading monomial, 
leading term, and leading coefficient of a labeled polynomial / we always 
assume the corresponding value of poly(/). In the same sense we define 
the least common multiple of two labeled polynomials / and g, t(/, g), by 
r (poly(/), poly(g)). Furthermore, if G = {gi, ...,j ( }c5x then we 
define 

poly(G) := {poly(ffi), . . . , poly (5*)} C 

5. A critical pair of two labeled polynomials f and g is a tuple (/, g) G 
(S x h+if. 

6. Moreover, we define the s-polynomial of two labeled polynomials f and g 

by 

spoly(/, g) = (u, u f ■ poly(/) - |^|% ' P ol y(.9) 
where 

(^) «/ = T M> u ° = T M e M > and 

(b) cj = ma,x{ufsig(f),u g sig(g)}. 

Adopting the notions of reduction and standard representation from the pure 
polynomial setting we get: 

Definition 2.2. Let /, g G 5x7^1 be labeled polynomials, and let G C Sxl i+ i, 
with #G = I. 

1. / reduces sig-safe to g modulo G if there exist sequences ji,---,je G N, 
t\, . . . , ti G M., ci, . . . , ce G 7C, and ro, • ■ • ,r# € S x Zj+i such that for all 
ie {!,...,£} g n G G, 
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(a) r = f,n = Ti-i - CiUg^, n = g, 

(b) lm(ri) < lm(rj_i), and 

(c) t i sig{g ji ) -< sig(r. t _i). 

2. We say that / has a standard representation with respect to G if there 
exist hi, . . . , hi 6 1Z, gi, . . . , ge € G such that 

(a) poly(/) = hi poly(gi) + h he poly(^), 

(b) for each k = 1, . . . , £ either hk = 0, or 

i. lm {hk) lm {g k ) < lm (/), and 

ii. lm {h k ) sig(fffc) r< sig(/). 

Remark 2.3. 

1. If / reduces sig-safe to g modulo G, then it has a standard representation 
modulo G. Moreover, note that the concept of sig-safeness, that means 
the restriction of the reducer gj i by £»sig(<7j 4 ) -< sig(r;_i) in each step, 
is essential for the correctness (and the performance) of signature-based 
algorithms. 

2. In Fact 24 of [S] it is shown that it is sufficient to consider signatures with 
coefficient 1. Thus there is no need to consider \c(hj) in Definition 12.11 
resp. terms in general for signatures. 

The following statement is the signature-based counterpart of Buchbcrger's 
Criterion, see 0. 

Theorem 2.4. Let G i+1 = {gi, . . . , ge] C S X such that {/i, . . . , C 
poly(Gj+i). If for each pair {j,k) with j > k, 1 < j, k < £, spo\y{gj,gk) has 
a standard representation w.r.t. G^+i, then poly(Gi+i) is a Grobner basis of 

Ii+1 ■ 

Proof. For example, see □ 

In the non-signature-based setting, an algorithm based on the Buchberger 
Criterion only would be quite inefficient. There the Product Criterion and the 
Chain Criterion, see are used to reduce useless computations; a notable 

implementation can be found in [lOj . On the signature-based side the very same 
holds: We need criteria to improve the computations, see [5] for more details on 
this topic. 

In the following we assume an incremental signature-based Grobner basis 
algorithm, for example, the reader can think of F5's presentation in [7], G2V as 
stated in [5] , or the more general discussion in [5] . 

Next we state the basic criteria used in F5. It is important to start from 
F5's point of view to see how optimizations in the signature-based world can be 
achieved. Note that we use the notations introduced in [5] for an easier adoption 
to incremental signature-based algorithms in general later on. 

Lemma 2.5. Assume the computation of a Grobner basis poly(Gi) for Ii, that 
means we are in the ith incremental step o/F5. 
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1. Non-minimal signature criterion (NM): spoly(/, g) has a standard repre- 
sentation w.r.t. Gi if there exists an element lo 6 PSyz(Fi) with lm(w) = 
t ul ej h such that 

(a) t u | u h t h , 

where Uh,sig(h) = Uhth&j h for either h = f or h = g. 

2. Rewritable signature criterion (RW): spoly(/, g) has a standard represen- 
tation w.r.t. Gi if there exists a labeled polynomial r such that 

(a) index(r) = index(/i), 

(b) sig(r) >- sig(h), and 

(c) t r \ Uhth, 

where sig(r) = t r ej h and UhSig(h) = UhthCj h for either h = f or h = g. 

Proof. The two statements of Lemma [2.51 arc included in Theorem 18 of [5]. 
Note that there (NM) is considered only for h being an element generated in the 
current iteration step, that means index(ft,) = i. Reviewing the proof one easily 
sees that the situation of index(/i) < i is just a special case already considered 
by the proof: There the two signatures M/sig(/) and u g sig(g) refer to M. and 
TV, where M. >- TV. In our situation u/jSig(/i) = TV, and any principal syzygy u> 
with the above mentioned properties can only decrease TV. The statement then 
follows by the very same argumentation as in the proof given in [5] . □ 

Remark 2.6. 

1. Note that all signature-based algorithms have in common that they handle 
their s-polynomials by increasing signature. This even holds for F5, also 
there the critical pairs are presorted by the degree of the pairs. Since F5, 
as presented in [51(7], works only with homogeneous input, this does not 
interfere an ordering w.r.t. increasing signatures. By the discussion in [5] 
F5 can also be used for inhomogeneous input by removing the presorting 
of critical pairs by increasing degree. 

2. F5C and G2V implement (NM) and (RW) quite differently. An in-depth 
discussion on this topic is given in [5J. These differences are explained in 
the following, too, inasmuch as they influence the ideas of this paper. 

3. The crucial fact for the optimization presented in this paper is that whereas 
checking (NM) is quite easy and cheap speaking in a computational man- 
ner, searching for possible elements r with which we can check (RW) costs 
many more CPU cycles. 

3 Computational overhead 

One of the main problems of signature-based Grobner basis algorithms is the 
overhead generated by the following kind of data: 

1. From the point of view of the resulting Grobner basis the elements are 
useless, that means the corresponding leading monomials are superfluous. 



■5 



2. For the correctness of the algorithm the very same elements are crucial: 
They are essential for the correct detection of useless critical pairs w.r.t. 
(NM) and (RW). 

Remark 3.1. This characteristic is unique to signature-based algorithms and 
cannot be found in other Buchberger-style Grobner basis algorithms. It does not 
only give a penalty on the performance, but unfortunately also causes problems 
with theoretical aspects, for example regarding the termination of F5, see [3] for 
more details. 

Next we state the pseudo code of the main loop of an incremental signature- 
based Grobner basis algorithm in the vein of F5, denoted SigGB. Note that 
we denote the incrementally called subalgorithm IncSig. 



Algorithm 1 SigGB, an incremental signature-based Grobner basis algorithm 

in the vein of F5 

Input: F m 

Ensure: G, a Grobner basis for I m 
l: Gi<-{(ei,/i)} 
2: for (i = 2, . . . , m) do 
3: fi <- Reduce (f h poly(G<_i)) 
4: if (/< ± 0) then 

5: Gi <- INCSIG (fi, 

6: else 

7: Gi <— Gi-i 

8: return poly(G m ) 



Let us start the discussion on computational overhead looking at F5 as 
presented in [7j first, so the following disussions refers to Algorithm [TJ The first 
drawback is the computation of non-minimal intermediate Grobner bases. 

1. Due to the fact that the signatures of the labeled polynomials must be 
kept valid during the (sig-safe) reductions taking place, some leading term 
reductions do not take place immediately, but arc postponed. These re- 
ductions, needed to ensure correctness of the algorithm, are computed 
when generating new critical pairs later o Thus at the end we could 
have three polynomials poly(/), poly(g), and poly(/i) in poly(Gr,) such 
that 

(a) lm(<7) | lm(/), but the reduction / — ctg has not taken place due to 
tsig(g) >- sig(/), for some c e K., t e M such that lt(/) = ctlt(g). 

(b) h is the result of the later on generated and reduced s-polynomial 
spoly(g, /) = ctg — /, which is sig-safe due to swapping ctg and /. 

In the end, we only need two out of these three elements for a Grobner 
basis; in a minimal Grobner basis we would discard poly(/). The problem 
is that for the correctness of the ongoing incremental step of F5 the labeled 

2 In [7] this kind of generation of new critical pairs is not postponed to the end of the current 
reduction step, but those are added to the pair set in place. These two ways of handling such a 
situation are nearly equivalent and do not trigger any difference for the overall computations, 
see [6]. The way it is described here makes it easier to see how the computational overhead is 
produced. 
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polynomial / as well as its addition to G,; is essentia^: Without adding 
/ to Gi the critical pair (g, /) would not be generated at all, thus the 
element h, possibly needed for the correctness of the Grobner basis in the 
end, would never be computed. So we are not able to remove / during 
the actual iteration step. 

Clearly, in the same vein the problem of non-rcducedncss of the Grobner 
basis poly(Gj), in particular, missing tail-reductions, can be understood. 

2. Since F5, when reducing with elements generated in the ongoing iteration 
step, processes top-reductions only, elements can enter Gi whose poly- 
nomials have tails not reduced w.r.t. poly(Gj). The main argument for 
not doing complete reductions in this situation is the requirement of sig- 
safeness: Comparing the signatures before each possible tail-reduction can 
lead to quite worse timings. On the other hand, from the point of view of 
the resulting Grobner basis poly(Gi), which consists only of polynomial 
data, we do not need to take care of sig-safeness and can tail-reduce the 
elements in poly(Gi) as usual without any preprocessed signature com- 
parison. This is way faster than implementing tail-reductions during the 
iteration step, although we have to use the non-tail-reduced elements dur- 
ing a whole iteration step. 

From the above discussion we get the following situation: 

1. The computational overhead during an iteration step is prerequisite for 
the correctness of F5. 

2. The set of labeled polynomials Gi returned after the ith iteration step is 
used as input for the (i + l)st iteration step, including the signatures. 

In |13j Stegers found a way optimizing at least the reduction steps w.r.t. 
elements of previous iteration steps. There the fact is used that F5 does not 
need to look for the signatures, due to the definition of -< all such reducers 
have a smaller index, and thus, a smaller signature: His variant of F5 computes 
another set of polynomials Bi after each iteration step, namely the reduced 
Grobner basis of Jj which is computed out of poly(Gj). In the following iteration 
step reductions w.r.t. elements computed in previous iteration steps are done 
by Bi, not by poly(Gj). 

In [3] the variant F5C of F5 is presented, which is based on the idea of 
Stegers, but goes way further: F5C is an optimized version of F5 interrcducing 
the intermediate Grobner basis poly(Gi) to Bi and uses these •polynomial data 
as starting point for the next iteration step. At this point we can look at 
Algorithm [1] from [5] , which illustrates one single iteration step of incremental 
signature-based Grobner basis algorithms: Let i = #J3j, any element bj € Bi 
gets a new signature ej, so that we receive elements gj = (ej, bj) in G,_|_i for 1 < 
3 < fi+i is then added to G i+ i by adjusting the index, g l+ i = (e^+i,/,-+i). 
On the one hand, proceeding this way the corresponding signatures of reduced 
polynomials are guaranteed to be correct from the view of the algorithm. On 
the other hand, all previously available criteria for detecting useless critical 

3 See [I] for more details, also on termination issues caused by this behaviour of signature- 
based algorithms. 
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pairs w.r.t. to labeled polynomials of index < I by (RW) in the upcoming 
iteration step are removed. Thus the question, if for the benefit of having less 
computational overhead is payed dearly by less efficient criteria checks in the 
following iteration steps, needs to be asked. 

The idea to get at least some data for using (RW) on elements of previous 
iteration steps in the following can be summarized in this way: 

1. For any two elements gj, g^ £ G^+i with polynomial parts in Bi, spoly {gj, g^) 
reduces to zero w.r.t. Gj+i, that means, it already has a standard repre- 
sentation w.r.t. Gj+i (since Bi is already a reduced Grobner basis). More- 
over, we can quite easily get the corresponding signatures of spoly(<?j, <?&) 
due to the fact that the labels of the generating elements, ej and e^, are 
trivial. 

2. With this in mind, we can create for each index 2 < j < £ a list of 
signatures generated out of sig (spoly (g j, gt)) for k < j. Those signatures 
can be used by (RW) detecting useless critical pairs in the next iteration 
step. 

Clearly, one wants to omit such a rccomputation of signatures during two 
iteration steps as much as possible. Luckily it is shown in [5] that this is not a 
problem at all: 

Proposition 3.2. Let spoly(/, g) be any s-polynomial considered during an 
iteration step of F5 with index(g) < indcx(/). Assume that j^r^ 9 would be 
detected either by (NM) or (RW). Then spoly(/, g) is also discarded in F5C. 

Corollary 3.3. For an s-polynomial in F5C it is enough to check a generator 
f by (NM) resp. (RW) if f was computed during the current iteration step. 

Proof. See Theorem 27 resp. Corollary 28 in [5]. □ 

Thus it follows that we do not need to recompute any signature after in- 
terreducing the intermediate Grobner basis poly(Gj) for checks with (RW). 

Let us add the above ideas in the pseudo code of Algorithm [5J We highlight 
the new step of interrcducing the intermediate Grobner basis, differing from the 
description of Algorithm [TJ There are two main changes: 

1. Gi now denotes already a set of polynomials in 1Z, as the labeled polyno- 
mials are newly generated at the beginning of each incremental step. 

2. Instead of IncSig we use a new algorithm IncSigR which takes a reduced 
Grobner basis -Bi_i as a second argument. Note that for IncSigR we refer 
the reader to Algorithm 1 in [B]. 

Remark 3.4. 

1. Also we do not state proofs for Proposition 13.21 and Corollary 13.31 here, 
but only refer to [5] , it is important to note that correctness of Proposi- 
tion l3.2l is based on the fact that F5C uses (RW) as presented above. G2V 
uses a relaxed variant of (RW), see [5] for more details. Still, G2V uses 
Algorithm [2] as an outer loop for its incremental computations. 
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Algorithm 2 SigGB with reduced intermediate Grobner bases 



Input: F m 

Ensure: G, a Grobner basis for 7, 

1: Gi <- {/i} 

2: for (i = 2, . . . , m) do 

3: <- RedSB (Gi_i) 

4: /i «- REDUCE(/,, 

5: if (/< ^ 0) then 

6: Gi^-lNCSlGRC/i.Bi-i) 

7: else 

8: G t <- G l - 1 

9: return G m 



2. In the following we can consider incremental signature-based Grobner ba- 
sis algorithms in general, i.e., Algorithm |2] can be seen as a wrapper for 
F5C and G2V. 

Having a common basis of our algorithms in question, let us see next how the 
initial presentation of G2V improved the field of signature-based computations. 

4 Using reductions to zero 

As described in [3] G2V can be seen a variant of F5C using a way more relaxed 
version of (RW): G2V only checks if the corresponding s-polynomials of two 
critical pairs have the same signature when adding the pairs to the pair set. In 
this situation only one of these two pairs is kept, the other one is discarded. We 
refer to [0J for more details. 

Thus G2V's efficiency is mainly based on its optimized variant of (NM). The 
idea can be explained quite easily: Whereas F5C uses only principal syzygies 
for (NM), since they are known beforehand and can be precomputcd, G2V goes 
one step further: 

Definition 4.1. During the (i + l)st iteration step of IncSigR we define 

iSi+i := {tef + i G S | tet + i signature of an s-polynomial 
that reduced sig-safe to zero }, 

where I + 1 is the current index of the labeled polynomials. 

Lemma 4.2 (Improved (NM)). Let £ + 1 be the current index of the (i + 
l)st incremental step of G2V computing a Grobner basis poly(Gj+i) for 
spoly(/, g) has a standard representation w.r.t. Gj+i if there exists an element 
u) G PSyz(Bi U {/j+i}) U Sf+i with lm(w) = t w e^_|_i such that t u \ u^th, where 
UhsigQi) = Uhthee+i for either h = f, index(/) = £ + 1 or h = g, index(g) = 
£+1. 

Proof. See Proposition 16 and Lemma 17 in [B]. □ 
Remark 4.3. 
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1. Switching from PSyz^+i) to PSyz (Bi U {/i+i}) is not a problem at all 
since (-Fi+i) = (Bi U {/i+i}) as -Bi is the reduced Grobner basis of ij. 

2. Note that the restriction to check elements of current index only is influ- 
enced by the discussion in Section [31 Whereas we know that Corollarv l3.3l 
ensures that F5C does not lose any useful information for rejecting useless 
critical pairs due to its aggressive implementation of (RW), this does not 
hold for G2V. So for G2V it is possible that removing the signatures of 
the intermediate Grobner bases can lead to the situation that less critical 
pairs are rendered useless by (NM). 

Clearly one can easily use (NM) as given in Lemma 14.21 in F5C instead 
of the plain one stated in Lemma [2.51 its correctness does not depend on the 
implementation of (RW). 

Definition 4.4. We denote the algorithm F5C with (NM) implemented as in 
Lemma jj by F5A0 

The question that comes to one's mind is the following: Why would it be 
of any benefit to switch from F5C to F5A? Whereas some of the signatures of 
zero reductions are used in F5C in (RW), not all of them can be used in each 
situation due to the restriction that sig(r) y sig(/i). It is already mentioned 
in [B] that F5A is way faster than F5C for non-regular input: F5C computes 
way more zero reductions than F5A, but cannot use it actively. Moreover, even 
if a corresponding (RW) detection happens in F5C, testing by (NM) in F5A is a 
lot faster as already discussed in Remark [3] We see in Section [H] that this has a 
huge impact not only on the timings, but also on the number of reduction steps 
taking place, due to the fact that (NM) and (RW) are used in F5C rcsp. F5A 
on possible reducers, too. 

5 Combining ideas 

So what is our contribution in this paper besides giving an overview of already 
known optimizations? Until now, the presented ideas of interreducing interme- 
diate Grobner bases and using zero reductions actively in (NM) are used without 
any direct connection: 

1. The Grobner bases are intcrrcduced between two iteration steps. This has 
an effect on the labeled polynomials computed in the previous iteration 
steps. 

2. Zero reductions arc used actively in a single iteration step only. This has 
an impact on current index labeled polynomials only. 

Of course, interreducing the intermediate Grobner basis poly(Gj) to -B.;has 
an influence on the upcoming iteration step inasmuch as less critical pairs are 
considered and reductions w.r.t. Bi are more efficient. Besides this we cannot 
assume to receive any deeper impact on the (i + l)st iteration step. On the 
other hand, it would be quite nice to use (NM) not only on current index 
labeled polynomials, but also on those coming from Bi. For this one could just 

4 The "A" stands for "actively using zero reductions". 
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precompute PSyz(Bi) and check the corresponding lower index generators of 
critical pairs using PSyz(Sj) in (NM). By Lemma [2.51 this would be a correct 
optimization. The crucial point is that we can do even better: 

At the moment we interreduce the intermediate Grobner basis poly(Gi) of Ii 
let us try to get some more resp. better signatures for checking (NM): We have 
already seen in Section [3] how to recompute some signatures for checking lower 
index labeled polynomials in the upcoming iteration step: We just compute 
and keep the signatures of the corresponding s-polynomials spoly(<7j, g^) for 
poly (<7j), poly (gfe) £ Bi. There we could reject their computations as we have 
proven that F5C would discard any s-polynomial detected with those signatures 
by (RW) anyway. Moreover, we know that all those signatures correspond to 
s-polynomials that already reduce to zero w.r.t. Thus, with the idea of 

Section [5] in mind, we can actively use those theoretical zero reductions actively 
for lower index labeled polynomials. 

Definition 5.1. Assume that poly(Gj) is reduced to Bi = {bi, . . . ,6^} after 
the zth iteration step of SigGB. Then we define 



Theorem 5.2 (Strengthening (NM)). Assuming the (i + l)st incremental step 
of a signature-based algorithm computing a Grobner basis poly(Gj+i) for ij+i, 
let £ + 1 be the current index of labeled polynomials. spoly(/, g) has a standard 
representation w.r.t. G^+i if there exists an element 



with lm(w) = t UJ ej h such that t u \ u^th, where u/ l sig(/i) = uy l t\ l ej h for either 
h = f or h = g. 

Proof. Clearly, the elements uj in PSyz {Bi U {fi+i}) can be divided in two types: 

1. lm(cj) = ie^+i, and 

2. lm(cj) = te 3 for 2 < j < £ + 1. 

Those elements of Type (1) from PSyz (Bi U {fi+i}) together with those of S^+i 
(sec Definition 14. ip are of the type tee+i, and thus do only detect generators of 
the s-polynomial in question which are computed during the current iteration 
step. The correctness of this statement follows from Lemma H. 21 

Next we consider Sj for 2 < j < i: Note that each element in such an Sj 
is of type tei i , that means those elements can only detect generators of the 
s-polynomial in question of index ij . Thus it is enough to consider one such Sj 

for a fixed j. Let te£ j S Sj, then t = ^^-y for some 1 < k < £j. Since Bj 

with #-Bj = (j is a reduced Grobner basis we know that spoly(&£ 3 . , bk) reduces 
to zero w.r.t. Bj. Moreover, note that due to the incremental structure of 
SigGB and the interreduction of poly (Gj) between each two iteration steps the 
sequence (b\, . . ., bij) of elements of Bj can be ordered in such a way that this 
zero reduction corresponds to a syzygy uj with lm(w) = tee - ■ Any other possible 
syzygy detecting a generator of index ij is of Type (2) from PSyz (Bi U 
By Lemma |2~51 (|Tj) the statement holds. □ 




u 6 PSyz (Bi U {fi+i}) U Si+i U S, U . . . U S 2 
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Remark 5.3. 



1. Note that the sets Si in Theorem 15.21 are computed recursively after the 
zth iteration step of SigGB. There is no connection between Si and 
Si-i for any i > 3. It is also crucial to only take the element of highest 
index £j from Bi into account when computing Si , we cannot ensure that 
spoly(t/j ;, <7fc) reduces sig-safe to zero w.r.t. Gi if j £i k. There a 
not sig-safe reduction could be possible to achieve the zero reduction of 
spoly((?j, <7fe), a problem that cannot happen if either j or k is equal to li. 

2. The main optimization compared to F5C is to use S\ up to 5*2 not in 
(RW), as it is described in Section [31 but in (NM). Compared to G2V's 
variant of (NM) lots of new checks are added that detect way more useless 
critical pairs as we see in Section \6\ 

Note that this problem has not been taken into account in [5] where such 
signatures are suggested to be used in (RW). 

One could think that the presented strengthening of (NM) is rather equiv- 
alent to the initial presentation in Lemma 12.51 but this is not the fact. The 
variant presented here is way more aggressive in finding useless critical pairs. 

Corollary 5.4. In Theorem \5.2\ elements of type lm(s) = tej G PSyz(Bi U {fi+i}) 
where j = =fJ=Bi for some 2 < I < i is need not be considered at all. 

Proof. Assume such an s S PSyz (£?,; U {fi+i}) with lm(s) = tej, j = for 
some fixed t. Then s — b^ej — bjek for some k < j. Let k be fixed. In St there 
exists some ue, with u = ^-ttt-. It follows that u 1 1. □ 

The idea is now to replace the implementations of (NM) in F5C, F5A, and 
G2V , respectively, by the version given in Theorem 15.21 

Definition 5.5. We denote the algorithms F5C, F5A, and G2V with (NM) 
implemented as in Theorem by iF5C, iF5A, and iG2V, respectively!! 

Algorithm[3] illustrates the main wrapper for an incremental signature-based 
Grobner basis algorithm based on the idea presented in this section. 

Example 5.6. Let pi = yz + 2, p2 = xy + ^xz + |, = xz 2 — 6x + 2z be 
three polynomials in Q[x,y,z]. We want to compute a Grobner basis for the 
ideal / = (jpi,p2,P3) w.r.t. the degree reverse lexicographical ordering with 
x > y > z using Algorithm [3] using -< as ordering on the signatures. 

Due to the incremental structure of the algorithm we start with the com- 
putation of a Grobner basis Gi of (pi,p2). After initializing f\ := (ei,p\) and 
/2 := (e2,P2) we construct the s-polynomial of f\ and fi is 



which does not have a standard representation w.r.t. G2 at the moment of its 
creation. Nevertheless, a new element is added to G2, namely 



spoly(/ 2 , /1) = (ze 2 , zp 2 - xpi) 




5 The "i" stands for "intermediate incremental optimization". 
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Algorithm 3 SigGB with reduced intermediate Grobner bases and optimized 
(NM) Criterion 



Input: F m 

Ensure: G, a Grobner basis for I, 



1 






2 


for (i = 2, . . . , m) do 




3 


<- RedSB (Gi_ 


i) 


■1 


/i «- REDUCE^, 5; _ 


-i) 


5 
6 


if (/; ^ 0) then 




7 


for (k = l,...,j- 


1) do 


8 






9 


Gi ^- lNCSlGR(/i, 


Bi-i) 


10 


else 




11 






12 


return G m 





Now we look at the s-polynomials 



spoly(/ 3 ,/i) = (yze 2 ,3?/poly(/3) - xzpoly(/i)) , 
spoly(/3,/ 2 ) = (yze 2 ,32/poly(/ 3 ) - z 2 poly(/ 2 )) . 



It does not make any difference which of these two s-polynomials wc compute: 
F5 would remove the later one by its implementation of (RW), whereas G2V 
would only store one of the two corresponding critical pairs in the beginning. 
W.l.o.g. we assume the reduction of spoly(/3, /i), the situation for considering 
spoly(/3, / 2 ) is similar and behaves the very same way: 



Further sig-safc reductions with 6/ 2 and 2/i lead to a zero reduction, i.e. we 
can add a new rule, namely yze 2 , to the set S2 as explained in Section Hl^l 

Since there is no further s-polynomial left, SigGB finishes this iteration step 
with 



We see that G 2 is already the reduced Grobner basis £? 2 of (pi,p 2 ), so Line [3] of 
SigGB does not change anything. Since Gi and Bi do not coincide inbetween 
most iteration steps, we need to remove S2 completely, since rules stored in 
there might not be correct any more. 

Let us have a closer look at how the new rules would be computed: First of 
all, any element in G 2 corresponds to a module generator of 1Z 3 , that means we 
can think of three labeled polynomials fa := (e.;, bi) for i £ {1, 2, 3}. During the 
lines [5] - [S] the signatures corresponding to spoly(/ 3 ,/i) and spoly(/3,/ 2 ) are 
added to 5 2 respectively: The first one gives the rule ye^, the second one also; 
thus we have S2 = {yes}- 

Next we see that reduces to zero w.r.t. _B 2 in Linc[3J 

6 Latest at this point spoly(/3, fa) would be removed by the rule yze2 G 52- 



spoly(/ 3 , /1) = (yze 2 , -6xy - 2xz + 2yz) 
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Thus the algorithm does not enter another iteration step, but terminates 
with the correct Grobner basis G = B^. 

Remark 5.7. 

1. Of course as stated in the pseudo code, SigGB would reduce the next 
generator p% of the input ideal w.r.t. intermediate reduced Grobner basis 
i?2, before it would recompute the syzygy list 5*2 in the above example. 
For the sake of explaining how the recomputation of such a rules list 
works, using a rather small example, we choose a complete discussion over 
efficiency in Example 15.61 

2. Note that it would be wrong to also add the signature of spoly(/2, /i) to 
52 once we have interreduced Gi to B 2 - Of course, spoly (62,61) has a 
standard representation w.r.t. B2, namely 

spoly(6 2 ,6i) = 63, 

but this does not lead to a standard representation of spoly(/2,/i) due 
to the fact that sig(/3) = e 3 y sig (spoly(/2, /i))- That is the one big 
drawback of interreducing the intermediate Grobner bases in incremental 
signature-based algorithms. Nevertheless, the speed up due to handling 
way less elements in Bi compared to Gi more than compensates this as 
shown in [5] . 

6 Experimental results 

We compare timings, the number of zero reductions, and the number of overall 
reduction steps of the different algorithms presented in this paper. To give a 
faithful comparison, we use a further developed version of the implementation 
we have done for [5J: This is an implementation of a generic signature-based 
Grobner basis algorithm in the kernel of a developer version of Singular 3-1-4. 
Based on this version we implemented G2V, iG2V, F5C, iF5C, F5A, and iF5A 
by plugging in the different variants and usages of the criteria (NM) and (RW) . 
There are no optimizations which could prefer any of the specific algorithms, so 
that the difference in the implementation between two of the above mentioned 
algorithms is not more than 300 lines of code; compared to approximately 3,500 
lines of code overall this is negligible. All share the very same data structures 
and use the very same (sig-safe) reduction routines. So the differences shown 
in Tables [T] [2J and [3J really come from the various optimizations of the criteria 
mentioned in Sections [3] - [5) 

Of course, to ensure such an accurate comparison of various different variants 
of signature-based algorithms has a drawback in the overall performance of the 
algorithms. Since we are interested in impact of the improvements explained in 
this paper it is justified to take such an approach. Clearly, implementing a highly 
optimized iF5A whithout any restrictions due to sharing data structures and 
procedures with an G2V can lead to a way better performance. It is not in focus 
of this paper to present the fastest implementation of such kind of algorithms, 
but to present practical benefits of the presented optimizations, focusing on the 
fact that all variants of incremental signature-based Grobner basis algorithms 
take an advantage out of them. 
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best 



worst 



Figure 6.1: Meanings of the colors in the respective tables 

The source code is publicly available in the branch f 5cc0 at 
https : //gitlmb . com/ederc/Sources. 
We computed the examples on a computer with the following specifications: 

• 2.6.31-gentoo-r6 GNU/Linux 64-bit operating system, 

• INTEL® XEON® X5460 @ 3.16GHz processor, 

• 64 GB of RAM, and 

• 120 GB of swap space. 

Due to the fact that we are comparing 6 algorithms we colorized the results 
presented in the respective tables for better readability, see Figure 16.11 

Our naming convention for examples specifies that "-h" denotes the homog- 
enized variant of the corresponding benchmark. All examples are computed 
over a field of characteristic 32, 003 w.r.t. the degree reverse lexicographical 
ordering. 

First of all let us have a closer look at G2V, F5C, and F5A. In Table [JJ we 
see that whereas F5C is faster than G2V in nearly all example sets, the ones 
which lead to a high number of zero reductions in F5C are computed way faster 
by G2V. This is based on the fact that G2V actively uses such zero reductions 
with adding new checks for (NM), whereas F5C only partially includes those 
signatures in its implementation of (RW). 

Comparing F5A and F5C we see that F5A is not only way faster than F5C 
in such highly non-regular examples like Eco-]|^|, but also that F5A is faster 
in other systems like Cyclic-8. Note that F5A is in all examples faster and 
computes less reductions than G2V. 

The ideas of Section [5] help iG2V to compute much loss reduction steps and 
discard way more useless critical pairs than G2V does. This comes from the 
fact that G2V does not implement any rewritable criterion besides its choice 
of keeping only 1 critical pair per signature. In most examples iG2V executes 
only half as much reduction steps as G2V, in some examples like F-855 it even 
alleviates to 15%. 

By our discussion in Section [3] it is not a surprise that there is no change 
in the corresponding numbers of reduction steps and zero reductions for iF5C 
rcsp. iF5A compared to the ones of F5C and F5A. Still the timings improve 
greatly which is based on the following facts: 

1. Altough the rules added to Si in Lines [5] -[5] are also checked by F5's (RW) 
implementation, checking (RW) costs more time than checking (NM) due 
to checking if the one is still in the area of correct (RW) rules. 

7 The results presented here are done with the commit key 
5c4dcll34a4ab630faab994dbe93d3013b4ccc7e. 

8 Note that F5C and iF5C cannot compute Eco-ll-h. 
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Test case 


G2V 


iG2V 


F5C 


iF5C 


F5A 


iF5A 


Cyclic-7-h 


25.714 


13.441 


5.369 


5.255 


5.7S 


5.180 


Cyclic-7 


24.613 


12.782 


5.66 


5.310 


5.216 


4.758 


Cyclic-8-h 


13,273.487 


10.22 


6,881.895 


3,689.003 


4,970.714 


1,907.076 


Cyclic-8 


12,386.925 


B2.3C 


6,606.116 


3,482.177 


4,814.173 


1,812.321 


Eco-9-h 


10.05 


6.741 


58.935 


59.182 


6.320 


5.550 


Eco-9 


18.755 


10.012 


12.899 


12.9S 


13.425 


12.899 


Eco-10-h 


78.66 


158.167 


2,472.763 


1,974.163 


188.683 


133.183 


Eco-10 


1,041.896 


497.025 


491.914 


518.346 


492.345 


97.81 


Eco-ll-h 


39.89 


5, 136.465 


— 


— 


7, 893.788 


4,815.971 


F-744-ti 


33.24 


25.724 


35.324 


36.740 


19.865 


19.865 


F-744 


27.312 


20.469 


8.198 


9.26 


8.458 


8.198 


F-855-h 


46.74 


290.856 


2,948.138 


2,381.813 


600.001 


422.219 


F-855 


971.491 


131.134 


83.185 


B6.34 


86.558 


84.020 


Katsura-10-h 


4.186 


4.193 


4.213 


4.491 


4.24 


4.256 


Katsura-10 


4.150 


4.183 


4.187 


4.217 


4.227 


4.1E 


Katsura-ll-h 


59.0C 


59.871 


58.689 


61.496 


58.411 


58.673 


Katsura-11 


53.894 


53.85 


53.464 


56.122 


53.984 


53.118 


Gonnet-83-h 


12.16 


10.617 


126.173 


25.963 


9.811 


8.761 


Schrans-Troost-h 


4.393 


4.250 


2.970 


3.4S 


3.087 


2.970 



Table 1: Time needed to compute a Grobner basis of the respective test case, 
given in seconds. 



2. To have all possible (RW) rules available, (RW) must be checked directly 
before the reduction step of the corresponding critical pair starts. At this 
point the critical pair can be stored for a long time, using memory and 
making the list of critical pairs longer. In 1F5C resp. 1F5A useless critical 
pairs can be found directly when the algorithm tries to create them. Thus 
they are not kept for a long time, keeping lists shorter, which does not 
only save memory, but also speeds up insertion of other critical pairs to 
the list. 

All three variants implementing the idea of Section [5] are, often multple times, 
faster in all examples, besides Katsura-X; for them we already know that the 
usual implementation of (NM), that means considering the principal syzygies, 
is already optimal. Thus all the ideas presented in this paper only add some 
small overhead in the computations of Katsura-X which does not affect the 
performance in a beneficial way. 



7 Conclusion 

This paper contributes a more efficient usage, generalization, and combination 
of, for specific incremental signature-based algorithms, known improvements. 
Even in situations where it docs not enlarge the number of detected useless 
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Test case 


G2V 


iG2V 


F5C,iF5C 


F5A,iF5A 


Cyclic-7-h 


1,750,989 


625,815 


100,569 


83,880 


Cyclic-7 


1,750,989 


625,815 


100,569 


83,880 


Cyclic-8-h 


113,833,183 


44,663,466 


14,823,873 


3,403,874 


Cyclic-8 


113,833,183 


44,663,466 


14,823,873 


3,403,874 


Eco-9-h 


409,880 


238,841 


1,996,849 


136,842 


Eco-9 


551,837 


310,745 


247,434 


247,434 


Eco-10-h 


3,760,244 


1,996,573 


19,755,560 


1,019,439 


Eco-10 


6,853,713 


3,352,474 


2,384,889 


2,384,889 


Eco-ll-h 


33,562,613 


16,695,766 




7,374,779 


F-744-h 


1,082,448 


693,630 


789,072 


435,869 


F-744 


473,838 


285,402 


179,100 


179,100 


F-855-h 


23,097,574 


4,407,938 


12,294,951 


2,633,666 


F-855 


7,976,163 


1,772,726 


835,718 


835,718 


Katsura-10-h 


18,955 


18,955 


18,343 


18,343 


Katsura-10 


18,955 


18,955 


18,343 


18,343 


Katsura-ll-h 


65,991 


65,991 


63,194 


63,194 


Katsura-11 


65,991 


65,991 


63,194 


63,194 


Gonnet-83-h 


113,609 


93,137 


278,419 


93,137 


Schrans-Troost-h 


19,132 


18,352 


14,010 


14,010 



Table 2: Number of all reduction steps throughout the computations of the 
algorithms. 



Test case 


G2V,iG2V 


F5C,iF5C 


F5A,iF5A 


Cyclic-7-h 


36 


76 


36 


Cyclic-7 


36 


76 


36 


Cyclic-8-h 


244 


1,540 


244 


Cyclic-8 


244 


1,540 


244 


Eco-9-h 


120 


929 


120 


Eco-9 











Eco-10-h 


247 


2,544 


247 


Eco-10 











Eco-ll-h 


502 




502 


F-744-h 


323 


498 


323 


F-744 











F-855-h 


835 


2,829 


835 


F-855 











Katsura-10-h 











Katsura-10 











Katsura-ll-h 











Katsura-11 











Gonnet-83-h 


2,005 


8,129 


2,005 


Schrans-Troost-h 












Table 3: Number of zero reductions computed by the algorithms. 
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critical pairs (F5C, F5A) it gives quite impressive speed-ups using a faster, less 
complex way of detecting useless data. 

Looking explicitly at G2V, the improvement in terms of removing redundant 
critical pairs is astonishing. Due to the fact that G2V lacks a real implementa- 
tion of (RW) the idea presented in Section [5] gives an easy way to add, at least 
partly, the strengths of F5's (RW) implementation to G2V without making the 
algorithm's description more complex. 

We have presented a rather neat improvement that can be added to any 
existing implementation of an incremental signature-based algorithm without 
any bigger effort. Besides having a huge impact on the computations for non- 
regular sequences, it does not slow down the overall efficiency of incremental 
signature-based Grobner basis algorithms for regular input at all. 
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